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Abstract 

^H , 

O ; Rapid .dentmcation of object from radar cross sect.on (RCS) signals is importar^t for many space and ..l.tary appl.cat.ons. 

, ' This identification is a problem in pattern recognition which either neural networks or support vector machines should prove to 

C^ ' be high-speed. Bayesian networks would also provide value but require significant preprocessing of the signals. In this paper, we 

describe the use of a support vector machine for object identification from synthesized RCS data. Our best results are from data 

j-y-j . fusion of X-band and S-band signals, where we obtained 99.4%, 95.3%, 100% and 95.6% correct identification for cylinders, 

Cn , frusta, spheres, and polygons, respectively. We also compare our results with a Bayesian approach and show that the SVM is 

three orders of magnitude faster, as measured by the number of floating point operations. 

a 

> 

^^ keep our problem simple and to allow us to explore a new classification algorithm, we will focus on four types of objects; 

l/~) cylinders, frusta, spheres, and irregular flat polygons, ranging in size from centimeters to meters. 

^iJ ', Geometric shapes of exo-atmospheric objects can be determined by analysis of the returned radar cross section (RCS) 

^D . signatures H]. Though some work has been done on computing the geometry from RCS |2|, the approach is generally not 



I. Introduction 

Classification of dynamic radar signals from exo-atmospheric objects is an important problem for military and space 
applications. Obviously different shapes and sizes can have significantly different meaning. Pebble-sized objects have little 
direct military interest as potential missiles. However, pebble-sized objects can be of great interest for space craft survival. To 



effective. Because the RCS signatures are unique patterns for different geometric types, this suggests using machine learning 
techniques for pattern recognition. A vast literature exists for pattern recognition of many types of signals, signatures, and 



H I images. Some of the most widely used are neural networks, Bayesian networks, and support vector machines (SVM). To use 
these pattern recognition systems generally requires a database of pairs of signatures and classes. In our case this would be a 
database of synthesized RCS returns from representative frusta, cylinders, spheres, and polygons. 

Machine learning techniques generally require two stages: one for training a system to learn how to separate geometric 
classes and a second for classifying new data as it arrives. The data used for our studies are simulated RCS returns from 
multiple geometric shapes. Accuracy is represented as an average of the percent correct, while precision is represented by 
measuring the standard deviation from a batch of runs consisting of training foUowed by testing and re-randomizing. 

Depending on the desired application, we may require pattern recognition from RCS signatures be done quickly so that 
decisions for dealing with this information can be made in a timely manner This requires that the computational burden be 
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minimized while maintaining the highest degree of accuracy. The three common pattern recognition methods mentioned above 
could potentially be fast enough for these applications. Each approach has its own advantages and disadvantages. 

All three methods require a database of labeled signatures. Generally, neural networks require at least three times as many 
data samples (preferably ten times as many) for training as the input dimensionality. So if our RCS signatures consist of 1,000 
data points, then we need 10,000 labeled signatures. This is potentially not a problem for synthesized databases. However, as 
the input dimensionality increases, the number of samples goes up. In many cases this is, or will quickly become, a problem. 
During training of the neural network, there is invariably a good chance the learning algorithm will converge to a local 
minimum in the nonlinear hyperspace of the weights. This can result in poor generalization or poor performance on testing 
and evaluation samples. Neural networks could be very fast for RCS pattern recognition because once the network is trained, 
the evaluation simply consists of a few vector-matrix multiplications and some function applications to vectors. Specifically, 
if we used a single hidden layer perceptron with 1,000 inputs and ten nodes in the hidden layer and one output, a new RCS 
classification would consist of multiplication of one 1000-element vector with a 1,000 x 10 element matrix to produce a 
ten-element vector to which we then apply a hyperbolic tangent for each element, followed by another vector-matrix product, 
in this case a ten-element vector by a 10 x 1 element matrix. The total would be on the order of 10 MFLOPS. Clearly the neural 
network could provide good speed, but the performance is not adequate for most applications, especially applications requiring 
significant accuracy. Neural networks also have the advantage that significant preprocessing or feature extraction of the RCS 
signatures may not necessarily be required. Farhat ID conducted early research on using neural networks for automated target 
identification, but not from RCS. His approach was more concerned with image reconstruction from microwave signatures and 
image recognition. At the image recognition step, he applied the neural networks. A good reference of neural networks and 
their use is Q. 

An alternative pattern recognition method that could be used for our RCS analysis is a Bayesian network ||3]. The Bayesian 
approach also requires a massive amount of data for training. It also requires pre-computation of the moments of the distributions 
of the data. This would necessarily require a, perhaps massive, reduction in dimensionaUty to apply Bayesian techniques to 
spectral identification and RCS signature classification. The approach typically involves extracting some features from the 
signatures or spectra and using that population of features for the computation by the Bayesian methods. As we will see, the 
pre-calculation of features often is quite time-consuming. 

This paper reports our use of support vector machine (SVM) ISJlQ. Similar to the other methods, it requires a database 
of labeled RCS signatures. The SVM has a number of distinct advantages over the other two techniques. Unlike the neural 
network, it is not too likely that the training will get stuck in some nonlinear space, because the data are first transformed to a 
linear space, albeit, perhaps of very high-dimensionality. Consequently there is a global minimum and the performance results 
can be far better than a neural network. Secondly, unlike the Bayesian network the SVM can accept very high-dimensional 
input vectors without preprocessing. 

In the following sections we discuss construction of an RCS database for frusta, spheres, cylinders and polygons. We then 
discuss an SVM for RCS signature classification and compare the results with a Bayesian approach. 

II. RCS Data Synthesis 

Three synthetic data sets were used to train and test the SVM. The data sets ranged from simple radar cross section (RCS) 
versus viewing aspect angle tables to complex simulations of scenarios incorporating multiple radars operating at different 
frequencies and locations. Most of the results presented in this paper are based upon a group of data sets of intermediate 

UNCLASSIFIED 



UNCLASSIFIED 

complexity, incorporating RCS versus viewing aspect for four object shapes of varying size and viewing aspect angle at two 
commonly used radar wavelengths, 3-GHz (S-band) and 10-GHz (X-band). The four shapes evaluated are: 

1) Spheres with radii varying from 0.001-2 m. 

2) Cylinders with diameters from 0.5-2 m and lengths from 1-12 times the diameter (1-20 m range). 

3) Frusta, blunt nosed cones, with heights (the distance from the nose to the tail) of 0.5-2 m, tail diameters from 25-100% 
the height, and nose diameters of 5-30% the tail diameter. 

4) Flat plate polygons with 3 to 5 edges with a maximum feature size of 0.3-6 m. 

A. Radar Cross Section 

The RCS, cr(A, r), for each of these objects is a function of frequency. A, and, for all objects except for the sphere, viewing 
aspect angle 9. The RCS models for the objects were drawn from a number of sources and each is unique. The RCS model 
for the sphere was adapted from [SI and includes three regions: 
Optical 

a = TTr^, r>A (1) 

Rayleigh 

2tt 
a « 97rr^(fcr)^ : where fc = — and r <C A (2) 

A 

and the Mie region, where the RCS is approximated using spherical Bessel functions. 

The RCS model of a right cylinder used in this study is 



(T = kal 



cos ft,- ^ ' 



(3) 



kl sin 6i 

where a is the radius of the cylinder, I is the cylinder length, and 6i is the angle from broadside incidence. This solution is 
derived in Knott |[T| using physical optics. Because this solution becomes singular when 9i = 0, we substitute in the exact 
broadside solution 

a = kaP (4) 

when ^i = 90 ± 0.2°. Because equation |3] does not account for the cylinder end caps, we employ the theory of superposition 
and add the RCS of a circular plate 



<ycap = TTk'a^ ( ""^v— ■--— / j (cQgff)^^ ^ (5) 



,2 4 /2Ji(2A:asin6; ^ 2 

^ * 2kasm9 



and 

^ = -^'^ = 0' (6) 

where 9 = corresponds to normal incidence (SJ. Figure [1] shows both the S and X band RCS versus aspect angle for a 0.5 m 
by 5 m long cylinder 

For the frustum, we elected to use a classic set of formulas from 1*91, which utilizes the geometric theory of diffraction to 

predict the RCS of a frustum using 4 scattering centers: 

4 
^e^P = J2 V^e^"' . (7) 

Figure |2] shows the locations of the four scattering centers and the key dimensions defining the frustum model. Assuming that 
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RCS vs Aspect angle for a 0.5 m radius x 5 m long cyclinder 
Top S-Band, Bottom X-Band 
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Fig. 1. RCS versus aspect angle for a 0.5 m by 5 m long cylinder at both (a) S-band and (b) X-band. 



the radar is monostatic, the contribution of the 4 scatterers is 



cr4 = < 
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(8) 



(9) 



(10) 



(11) 



with 



51 = 



52 



sin(7r/r;i) /oicsc 



??i 



sin(7r/7]2) ja2csc9 



mi = cos(7r/77i) 

7772 = COs(7r/772) 



(12) 

(13) 
(14) 
(15) 
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Fig. 2. Scattering centers geometry for frustum RCS. The axis is of rotation is normal to the plane of the paper, centered at the origin of the dashed axes. 



and the phases referenced to the base of the frustum are given by 

pi = -2fc[ai sin 6* + 2/1008 6*] +7r/2 

P2 = —2ka2Sm9 + Tr/4: 

P3 = 2fc[aisin0 - 2 008 6*] - 7r/4 

P4 = 2ka2 8in 6 — 7r/4 



where a is the frustum angle in radians 



and 



a = tan 

V2 = 



a2 — ai 
2h 



TT 

a 



(16) 
(17) 
(18) 
(19) 

(20) 

(21) 
(22) 



2 TT 

and where 9 is the viewing aspect as measure along the axis of symmetry from the large end of the frustum (02). The choice of 
signs in equations lISl- lfTTI relate to the polarization convention (upper sign for vertical polarization, lower sign for horizontal 
polarization). For this analysis, we assumed vertical polarization. As is the case for the cylinder, this solution has singularities 
at 6* = 0, 7r/2 — a, vr. These singularities have been handled as described in (]9l- Figure [3] shows both the S- and X-band RCS 
versus aspect angle for a frustum where di = 0.14 m, d2 = 0.72 m, and h = 1.25 m, as shown in Figure [3] 

Lastly for the A^-sided polygon, we applied a physical optics solution from IS), where the expression for an arbitrary A^-sided 
polygon in a local coordinate frame has been evaluated analytically as 

hx an- Ctn-l 



N 



S{u,v) = ^i 



J('^--r) 



{U) ■ OLn){u} ■ OLn-l) 



(23) 
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RCS vs Aspect angle for ad =0.14 m, d =0.72 m, h=1.25 m Frustum 
Top S-Band, Bottom X-Band 
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Fig. 3. RCS versus aspect angle for a frustum at both (a) S- and (b) X-bands. 



where 7„ are the polygon vertices and a„ are the edge vectors given by 

7n+l ^ In 



Oin = 



(24) 



l7„+i-7„l 

and LJ = ux + vy. Figure |4] shows a randomly generated five-sided polygon, typical of those generated for use in this study. 
Figure |5] shows the calculated RCS for the polygon shown in Figure |4] when it is rotated about the axis shown in Figure |4] 
(6 = 0, 180° represent the edge on condition). 



B. Training Sets 

Now that we have defined the objects that are going to be incorporated into data sets we need to use these models to 
generate simulated radar data. Ideally we would like to train the SVM using data similar to what would be observed in an 
actual engagement but for simplicity we started with a scenario similar to what one would encounter on a radar test range. The 
first set of data consisted of 10,000 randomly generated frusta and cylinders whose RCS versus aspect angle were calculated 
using 2,000 evenly spaced samples from 0-180°. 

Once the performance of the SVM had been verified using this data set, we increased the difficulty of the problem by varying 
the aspect through which each object was rotated. The second data set consisted of randomly generated frusta, cylinders, spheres, 
and polygons rotated through an aspect starting at 0° and ending at 180° (the RCS was calculated at 2,000 evenly distributed 
points through this rotation). There were 5,000 data vectors for each object type, resulting in a total of 20,000 data vectors. 
The final data set consisted of 20,000 randomly generated frusta, cylinders, spheres, and polygons rotated from a random start 
angle with a minimum rotation of 180° and a maximum rotation of 1080°. Figure |6] shows the RCS versus sample for six 
objects in the final data set, with sample being the sampled aspect angle. 
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Fig. 4. A typical five sided polygon. 



Summarizing, our data consisted of three data sets. The first data set comprised 4930 cylinders and 5070 frusta for a total 
of 10,000 objects, approximately a 50% distribution between the two classes. The data for each object consisted of 2,000 RCS 
values, starting at broad-side specular to an observing monostatic radar and rotating one-half revolution. The axis of rotation 
was normal both to the axis of symmetry of the object and the direction of observer. The frequency was selected at random 
from one of the following four frequencies: 900 MHz, 3 GHz, 6 GHz, and 10 GHz, simulating four different viewers. The 
RCS can be obtained from a radar return given an object's range and the gain from the radar station, and varying it across 
azimuth simulates the changing perspective an observing ground radar would have of dynamic objects. 

The second data set comprised objects from four different classes: cylinders, frusta, spheres, and random polygonal plates. 
The data for each object consisted of 2,000 radar cross-section (RCS) values for each object at a 0°-placement relative to an 
observing mono-static radar and rotating at a random rate with revolution over the 2,000 values ranging between one and five 
periods. Five thousand objects were generated in each of the four classes, with varying lengths and radii, resulting in 20,000 
total objects. Again the data set consisted of the four frequencies for data set 1. 

The third data set replicated the second data set, except that the starting angle to each observer was chosen randomly. Also, 
the returns for each object at S-band (3 GHz) and X-band (10 GHz) frequencies, effectively simulated the returns from two 
separate ground radars observing the same object. This effectively doubles the amount of data for each object, i.e., the length of 
each vector. Our goal was to combine the two radar signatures, via sensor fusion, to improve classification and identification. 
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RCS vs Rotation angle about the Y axis for a 5 sided polygon 
Top S-Band, Bottom X-Band 
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Fig. 5. RCS versus aspect angle for the polygon shown in Figure |4] for (a) S-band and (b) X-band. Axis of rotation shown in Figure |4] 

III. Background to Support Vector Machines 

Having synthesized the RCS data, we are ready to begin the processing with the support vector machine (SVM). The 
following theoretical outline is similar to Vapnik jT). Given our set of labeled training pairs of RCS signatures and class labels 
for what they represent we can write this as follows: {xx,yi), (0:2,2/2), (3^3,2/3), ■ • -, where Xi represents the RCS signature 
and Hi represents the class label (sphere, frustum, cylinder, polygon). To simplify the problem, we will label an RCS signature 
as +1 if it represents a cylinder and —1 if it represents anything else. We thus partition our data set into four labeled groups, 
depending on the object type. As described above in Section |II] each object-type data set consists of 5,000 RCS signatures. 
We can now form a separating hyperplane as follows: 



w — Xi + b > 1 if Hi = 1 
w — Xi + b < 1 if Hi = —\ 

where we see the output classes are —1 and +1. The inequalities in the above equation can be written as: 

yi{w ■ Xi+b)>l, i = 1,. . .,n 
for n training samples. The relations in this equation become constraints for the optimal hyperplane given by 

Wo ■ X + b — 



(25) 



(26) 



(27) 



where & is a constant known as the bias. This is the unique hyperplane for maximum separation of the training data. The vectors 
(RCS signatures) for which the left-hand-side is equal to 1 are called the support vectors. The optimal separating hyperplane 
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Fig. 6. RCS versus sample (aspect angle) for six objects in the final data set. From upper-left to lower-right, moving left-to-right: (a) frustum, (b) cylinder, 
(c) frustum, (d) polygon, (e) cylinder, and (f) sphere. 



is: 

where a° > 0. If we let Aq = (a°, 
maximize 

subject to the constraints 



Wo 



— / ^ ViC^i^i 



(28) 



, a°) then we could solve the quadratic programming problem. Basically we want to 
1 " 



w{A,6)^A'l 



A^DA 



cA^Y = 
(5 > 
< A <5u 

where Y^ = (yi , . . . , j/„) is a vector of labels; u^ is a unit vector; D is an n x n matrix with elements given by Di 



(29) 

(30) 
(31) 
(32) 



ViVj^^ 



with i, j = (1, . . . , n); and amax = max(Q:i, . . . , a„) are the weights for the support vectors and c is a constant analogous to 
bias. The classifier function is now 



f{x) — w ■ (j){x) + b 

n 
i=l 

The values 0(xi) are the transformed objects in feature space. We can rewrite the function as 

f{x) = (f){xi) -w + b 

= y^^yiai<p{xi) ■ (f){xi) + b 



(33) 
(34) 



(35) 
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or more compactly as 

n 

f{x) = ^yiaiK{x,Xi) (36) 

This kernel representation is the conventional way of representing a support vector machine. As pointed out above, we could 
find the support vector weights, ai, by quadratic programming. Instead, we will use the perceptron algorithm when combined 
with an SVM, also known as a kernel adatron. The algorithm is basically a gradient descent in error space to find the support 
vectors. Veropoulus [10| is a good reference for the kernel adatron method. 

There are several available methods for computing kernels for SVMs. These include the linear kernel, the polynomial kernel, 
the radial basis function (RBF) kernel, and the hyperbolic tangent kernel. In this analysis, only the linear kernel was used. The 
linear kernel is computed on the L x N matrix containing the training vectors as a covariance matrix calculation, K = LL^ 
where K is a symmetric, positive definite matrix. The resulting kernel can then be normalized by: 



D = y/diag{K) (37) 

K = DKD (38) 

Once K has been computed, a gradient descent process is used to determine whether or not each vector is a support vector 
S support vectors are selected from the initial L training vectors. After the learning process converges the location of the non- 
zero elements in the a vector, an L-dimensional vector, are pointers to the support vectors. The actual value of the non-zero 
elements in this o; vector are the coefficients for the testing phase. 

A vector under test, in this case an RCS signature, v, can be classified by computing the dot-product of v with each support 
vector, s, scaling each result by a and the response, y, of each support vector. The responses for the support vector are +1 

for positive cases and —1 for negative cases. The vector under test is evaluated according to: 

s 
z = 2_, ^jVj'*^ ■ ^ (39) 

If z > 0, the object under test is classified as belonging in the set; otherwise it is classified as being outside the set. 

IV. Training and Testing 

From the 10,000 objects in the first data set, two trials of training took place: one with 400 randomly selected objects, with 
testing on the remaining 9,600; and one with 4,000 randomly selected objects, with testing on the remaining 6,000. All tests 
were repeated ten times (ten-fold cross-validation) to obtain statistical measurement for accuracy and precision. 

From the 20,000 objects in the second set, 4,000 were randomly selected for training the kernel adatron. The remaining 
16,000 objects were then tested against the support vectors and the weight vector, a.. Tests were performed for each of these 
four classes using a separately-trained SVM classifier: 

• Cylinder vs. non-cylinder 

• Frustum vs. non-frustum 

• Sphere vs. non-sphere 

• Polygons vs. non-polygons 
All tests were repeated ten times. 

From the 20,000 objects in the third data set, 8,000 were randomly selected for training the kernel. The remaining 12,000 
objects were then tested against that kernel. Four tests were performed for each of the classes, as in the second data set. All 
the tests were repeated twenty times (twenty-fold cross-validation). 
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We used the kernel adatron algorithm presented on pages 80-81 in IfTOll . with fixed values of 77 = 0.01, r = 500, and a 
threshold of 5,000. These values affect the selection of the support vectors, and thus the classification results. However, they 
remained fixed for the purpose of these numerical trials. Table |T] describes each experiment performed, including the data set 
used, the number of vectors used for training and testing, the length these each vectors compared to the ones in that data set, 
and other descriptive information. 



exp. 


data 


vectors 


vectors 


vector 


brief 




no. 


set 


trained 


tested 


length 


description 


1 




400 


9600 


full 






2 




4000 


6000 


full 






3 




400 


9600 


quarter 






4 




4000 


6000 


quarter 






5 




4000 


6000 


quarter 


FFT 




6 




4000 


6000 


quarter 


FFT-w 




7 


2 


4000 


16000 


full 


FFTdc, 


1-10 rot., random 


8 


2 


4000 


16000 


quarter 


FFTdc, 


finst quarter 


9 


2 


4000 


16000 


quarter 


FFTdc, 


second quarter 


10 


2 


4000 


16000 


quailer 


FFTdc, 


third quarter 


11 


2 


4000 


16000 


quarter 


FFTdc, 


fourth quarter 


12 


3 


8000 


12000 


full 


FFTdc, 


X-band, S-band 


13 


3 


8000 


12000 


quarter 


FFTdc, 


X-band, S-band, random 


14 


3 


8000 


12000 


quarter 


FFTdc, 


X-band. S-band, random. 












moments 



TABLE I 
Summary of the experiments (see text for more details). 



A. First Data Set 

We tested only for cylinders, so the non-cylinder class is equivalent to the frustum class because there are only two classes 
in this data set. Testing was performed in a variety of ways. First, the raw RCS over the varying azimuths was used, with 
the full 2,000 samples per object. Training on 400 random objects and testing on 9600 objects resulted in 99.4% ± 0.9%, 
decomposed as shown in experiment 1 of Table [III Training on 4,000 random objects and testing on 6,000 random objects 
resulted in 100.0% ± 0.0%, decomposed as shown in experiment 2 of Table HIl 



exp. 


cylinder 


frustum 


total correct 


number 


/x 0- 


^ 0- 


fi a 


1 


49.1 (1.4) 


50.3 (1.3) 


99.4 (0.9) 


2 


49.3 (0.0) 


50.7 (0.0) 


100.0 (0.0) 


3 


45.9 (12.2) 


49.7 (3.1) 


95.6 (5.7) 


4 


47.6 (3.8) 


50.2 (4.3) 


97.8 (2.5) 


5 


45.7 (6.4) 


50.1 (0.6) 


95.9 (2.9) 


6 


47.1 (4.2) 


48.7 (8.4) 


95.6 (3.5) 



TABLE II 

The result of the experiments, where /^ is the mean correct and a is the standard deviation when run multiple times (see text for 

MORE details). ALL UNITS ARE PERCENTAGES. 
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exp. 


cylinder 


frustum 


sphere 


polygon 


number 


^ a 


t^ o- 


fj. a 


^l a- 


7 


99.6 (0.3) 


93.2 (7.7) 


100 (0.0) 


83.8 (14.2) 


8 


76.8 (5.2) 


74.8 (7.7) 


100 (0.0) 


76.5 (22.4) 


9 


93.8 (2.9) 


73.0 (9.7) 


95.5 (9.5) 


79.5 (16.6) 


10 


91.4 (3.4) 


84.3 (7.1) 


100 (0.0) 


85.0 (4.9) 


11 


91.9 (3.1) 


87.7 (3.0) 


100 (0.0) 


72.1 (16.1) 


12 


99.6 (0.4) 


84.9 (2.6) 


100 (0.0) 


88.3 (2.4) 


13 


91.8 (2.1) 


78.2 (9.6) 


100 (0.0) 


85.8 (3.1) 


14 


99.4 (0.8) 


95.3 (3.3) 


100 (0.0) 


95.6 (4.1) 



TABLE III 
The result of the experiments, where /^ is the mean correct and a is the standard deviation when run multiple times (see text for 

MORE details). ALL UNITS ARE PERCENTAGES, AND REFLECT OBJECTS CORRECTLY IDENTIFIED IN INDEPENDENT TESTS. 



When using just one-quarter of the 2,000 samples per object, corresponding to 45° of revolution, results could be httle 
better than a random guess (50%) depending on the swath visible to the observer; these results are not included in Table |ll] 
At this point, testing began on data processed by a Fourier transform instead of the raw azimuthal RCS values. Only the 
magnitude values of the data in the were used. The data were normalized by the magnitude of the DC component (the first 
value). Repeating the above tests resulted in 95.6% ± 5.7% with 400 randomly selected training objects and 9600 testing 
objects, decomposed as shown in experiment 3 of Table |II] Training over 4,000 objects and testing on 6,000 objects resulted 
in 97.8% ± 2.5%, with the results shown in experiment 4 of Table HIl 

These results compare well to using the raw RCS values. When only a quarter of the RCS data were selected, results match 
those of the raw RCS, which depend on the visible swath of data to contain distinguishing characteristics. However, an SVM 
is dependent on data being placed in the same place in the data vector Because we will not know the aspect angle to the 
observer, raw RCS returns are not as useful. Fourier-transformed data places information in the same feature bins of each data 
vector, and this now removes this issue for SVM training. 

When a Kaiser window was applied to the data, the results do not change significantly. Using 4,000 randomly-selected 
training samples, following a Fourier transform, of objects and testing on the remaining 6,000 objects, the results without the 
window were 95.9% ± 2.9%, as shown in experiment 5 of Table HH while with the window, the results were 95.6% ± 3.5%, 
decomposed as shown in experiment 6 of Table HIl (marked as FFT-w in TableH]). This shows that windowing did not significantly 
affect the results. 

B. Second Data Set 

Testing with this set now shows results when the amount of rotation is not fixed to one revolution but instead corresponds 
to a random number of full revolutions (marked as 1-10 rot. in Table IJi, up to ten, for each object. Now that we have four 
objects with equal populations in the data set, a random guess can be considered 75% accurate because stating everything is 
not in the class would be correct 3/4 of the time. All testing is performed on data following a Fourier transform, with the DC 
component (marked as FFTdc in Table |I| normalized to unity. 

Using the full-length 2,000-point RCS data, training on a randomly-selected 4,000 objects, and testing on the remaining 
6,000 objects, the results were those shown in experiments 7 of Table Hill The SVM correctly identified the objects as cylinders 
99.6% of the time, frusta 93.2% of the time, spheres 100% of the time, and polygons 83.8% of the time. Tests using the second 
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and third data sets used the four separately-trained SVM classifiers. 

Training on consecutive, one-quarter-length swaths and testing with one-quarter-length data swaths, gave the results shown 
in experiments 8-11 of Table HiH The results from these tests make sense. Keeping in mind that the starting point for all 
objects in this data set is at 0°, which is nose-on to the observer while 90° is broadside, and the objects each rotate at random 
speeds, the first quarter contains little useful information to distinguish a cylinder from a frustum. Compare the RCS returns 
of Figure [T] and Figure [3] for aspect angles 0° - 45° to see the difficulty in coiTectly classifying objects using only the first 
quarter of the data. Spheres are always easy to distinguish given the uniform RCS returns at every aspect angle. Polygons are 
expected to be difficult to distinguish given they exhibit different RCS return behavior with their varying sizes and number of 
sides. Proceeding from quarter to quarter, the chance of including useful RCS information increases, albeit not linearly. The 
third data set corrects this by starting at a random aspect angle to the observer. 

C. Third Data Set 

All the data in the third data set was processed by a Fourier transform and normalized to unity DC. Each object has two sets 
concatenated together, one for S-band returns and one for X-band returns. The third data set trains on 8,000 randomly selected 
objects. Using 2,000 RCS data points from both ground observers resulted in what is shown in experiment 12 of Table HU] 
The objects were identified correctly as cylinders 99.6% of the time, frusta 84.9% of the time, spheres 100% of the time, and 
polygons 88.3% of the time. 

When using a random swath of 500 consecutive time samples for each object, instead of the full 2,000, we get the results 
shown in experiment 13 of Table HiH The results for the correct identification of objects fell slightly, with cylinders at 91.8%, 
frusta at 78.2%, spheres at 100%, and polygons at 85.3%. 

The first four moments, mean, variance, skew, and kurtosis, were appended as data points to the Fourier-transformed vector 
for each object. These are the first four moments of the RCS data normahzed so the largest moment has a value of one (marked 
as moments in Table Ul. These were computed for both the X-band and the S-band data sets, resulting in eight additional data 
points. The results of repeating these tests with the augmented data for the quarter-length data are as shown in experiment 14 
of Table Hill These resulted in significantly improved identification, with cylinders at 99.4%, frusta at 95.3%, spheres at 100%, 
and polygons at 95.6%. 

V. Discussion and Conclusions 

We have shown that it is possible to obtain very good classification of synthesized RCS signatures following a Fourier 
transform, with little or no other preprocessing. This reduction in computational load can have significant impact in some 
applications. This direct testing on Fourier-transformed data obviates handling the starting aspect angle of data collection; the 
additional features, such as the first four statistical moments of the RCS data (mean, variance, skew, and kurtosis), may be 
added to an input vector to improve accuracy. Table |lll] contains a summary of the results for all the numerical experiments. 
Accuracy is measured as an overall percentage correct, with ten-fold (data set 1) or twenty-fold cross-validation (data sets 2 
and 3). In an actual scenario, knowledge of the aspect angle of the object to the observer would not be known. Also, there 
is no guarantee that a full revolution of the object will be viewed. Therefore, the results from experiments 13 and 14 are 
likely closest to actual. Because the overall results from experiment 14 are better, using the moments should provide enhanced 
performance. Experiment 14 provides a start at reasonable results for an actual scenario. 
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In order to relate our SVM results with another technique, we compared them to a Bayesian network. Bayesian networks are 
a popular technique for pattern recognition and classification ifSl lfTTj . The basic concept is to combine conditional probabilities 
using Bayes' theorem. 

Donald Maurer of Johns Hopkins University, Applied Physics Laboratory, has applied the Bayesian technique to the 
identification of cones and cylinders lfT2l . His initial starting point was RCS data from which he extracted several features. 

As described in Maurer's paper, he used data of this type to build the Bayesian network. Data were processed as ten-degree 
aspect angle increments. The best overall performance by the Bayesian approach classifies cylinders from cones correctly 99% 
of the time ifTH . 

For comparison, we used Maurer's data and processed it by a SVM. The features of the data were analyzed using the kernel 
adatron. The four features that most contributed to identification in a Bayesian classifier were selected from both the cone and 
cylinder data sets lfT2l . These comprise one object each, with features from azimuths in the range to 180° in one-tenth-degree 
increments. Ten-degree swaths of each of the four features were used to compose the data for each vector, giving each vector 
a length of 400 features. 1700 vectors were created for the cone, and 1700 for the cylinder. The kernel adatron trained on 
1700 randomly selected vectors, and tested on the remaining 1700, in a twenty-fold cross-validation process. 

The results show that, statistically, they are equivalent to a random guess (52.1% correct ±6.2%). Given that the features at 
each azimuth do not necessarily correspond to those at other azimuths, this comes as no surprise. It is difficult to distinguish 
an object given a swath of features related to an azimuth; much easier by applying a Fourier transform on the data, fixing the 
data in the bins regardless of the azimuth viewed. 

There is nothing exclusive about the Bayesian success with these features; to wit, results from support vector machines on 
data could be combined with results from Bayesian beUef networks to arrive at a classification more accurate than either could 
provide. 

Our results on our FFT-preprocessed, synthesized RCS returns with the kernel adatron that was then processed by the 
SVM results in the following performance: spheres, 100%; cylinders, 99.4%; frustum, 95.3%; polygons, 95.6%. These results 
are comparable to the Bayesian approach with feature extractions. Further, our results are more comprehensive because the 
individual SVM is classifying the spheres, cylinders, frusta (more complex than cones), and polygons. The computation load 
with the SVM (including prior feature extraction of FFT) is about 2 MFLOPS. The computational load with feature extraction 
and identification by the Bayesian method is 2 GFLOPS |13|, a three-orders-of-magnitude greater computational load). The 
main problem with the work in [12J is the significant preprocessing and feature extraction from the RCS data prior to submitting 
the information to the Bayesian network. 

In conclusion, the kernel adatron, a support vector machine, classifies objects at least as well as the conventional Bayesian 
network in |12| for RCS signature classification, with far fewer computational operations required. It is especially applicable 
to time-critical applications where false negatives may have devastating consequences lfT4l ifTSl lfT6l . 
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